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Abstract 

The relation between the galaxy correlation function in real and redshift-space 
is derived in the linear regime by an appropriate averaging of the joint probabil- 
ity distribution of density and velocity. The derivation recovers the familiar linear 
theory result on large scales but has the advantage of clearly revealing the depen- 
dence of the redshift distortions on the underlying peculiar velocity field; streaming 
motions give rise to distortions of 0(f2°'®/6) while variations in the anisotropic ve- 
locity dispersion yield terms of order 0{p}'^ jlP'). This probabilistic derivation of the 
redshift-space correlation function is similar in spirit to the derivation of the com- 
monly used "streaming" model, in which the distortions are given by a convolution of 
the real-space correlation function with a velocity distribution function. The stream- 
ing model is often used to model the redshift-space correlation function on small, 
highly non-linear, scales. There have been claims in the literature, however, that the 
streaming model is not valid in the linear regime. Our analysis confirms this claim, 
but we show that the streaming model can be made consistent with linear theory 
provided that the model for the streaming has the functional form predicted by linear 
theory and that velocity distribution is chosen to be a Gaussian with the correct 
linear theory dispersion. 



1 Introduction 



Traditionally the two-point correlation function has been the most popular statistic for 
quantifying galaxy clustering. The correlation function is usually measured from a galaxy 
redshift catalog by computing pairs in redshift-space. The resulting redshift-space correla- 
tion function, r^), is a function of pair separations both parallel (r^) and perpendicular 
(rcr) to the observer's line of sight; it is an anisotropic function that systematically differs 
from the real-space correlation function, ^(r). On small scales, ^(r"a.,r^) is suppressed due 
to the elongation of non-linear structures along the line of sight, the familiar "Fingers of 
God." On large scales an opposite effect occurs: coherent in-fall into overdense regions 
and out-flow from underdense regions compresses structures in redshift-space thereby am- 
plifying ^(r"a.,r^). The redshift-space correlation function thus provides a valuable probe of 
the peculiar velocity fleld in both the non-linear and linear regimes. In the gravitational 
instability picture of structure formation, the amplitude and coherence of the velocity fleld 
is directly related to clustering of the underlying mass distribution; by quantifying the dis- 
tortions in the redshift-space clustering, we can hope to measure important cosmological 
parameters such as shape of the power spectrum on large scales and the mean mass density, 

n. 

There have been two models used in the literature to describe ^(r"a.,r^). The flrst and 
perhaps most well known model expresses ^(r"a.,r^) as a convolution of the real-space cor- 
relation function with the probability distribution for velocities along the line of sight 
+00 

C{r,,r.)= J dyi{T) 
—00 

where = y^ + r^ and fi2(r") is the mean relative peculiar velocity of two galaxies separated 
by r (cf., Peebles 1980, § 76; Peebles 1993, p. 478). F^[V) is the probabihty distribution of 
relative velocities about fi2(r") and is usually assumed to be isotropic with zero mean. By 
"isotropic" we mean that the tensor second moment of the velocity distribution, {viVj), is 
assumed to of the form, cr^Sfj [Sfj denotes the usual Kronecker delta symbol). Throughout 

■"^We will work in units where the Hubble constant is set to unity, i.e., velocities will have the same units 
as distances. 
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this Paper, the model for ^(r"a.,r^) given in Equation 1 with an isotropic dispersion will 
be referred to as the "streaming" model. The streaming model has mainly been used to 
estimate the galaxy pair- wise velocity dispersion from the suppression of ^(r"a.,r^) on small 
scales (e.g., Davis & Peebles 1983). 

The second formulation of the redshift-space correlation function is based on the linear 
theory of gravitational instability. In linear theory, the peculiar velocity field can be deter- 
mined directly from the density field; the mapping from real to redshift-space is unique and 
i{fcTif-K) is completely specified. The redshift-space correlation function in linear theory 
has been calculated by a number of authors (Kaiser 1987; Lilje & Efstathiou 1989; McGill 
1990; Hamilton 1992). The result can be compactly written as (Fisher, Scharf, & Lahav 
1994) 2; 

"4 „ 4 



2 ^ 1 , 
1 + 3/3+3/3^^ 



8 



+ 



35^ . 



3^+7^^. 



where /3 = 0,°'^ /b is the linear theory growth factor [b is the bias parameter which is assumed 
to be independent of scale), s' = r^+r^, and fi = The expansion coefficients are given 

by moments over the power spectrum, = ^ / k' P{k)ji[ks), where ji{x) and Vi{x) 
are the usual spherical Bessel function and Legendre polynomial, respectively, of order /. In 
introducing the bias factor b in Equation 2 we are defining the power spectrum, P{k), to be 
that of the galaxy distribution and not that of the underlying mass, i.e. P{k) = b'Pmassik). 
Consequently, the real-space galaxy correlation function in this notation is simply ^(r) = 
^o('")- Throughout this Paper, we will work with quantities of the galaxy distribution; thus 
6[r) will refer to fluctuations in the galaxy distribution and not the mass. Consequently 
factors of the bias parameter, 6, will arise when we relate velocities (which depend on the 
mass fluctuations) to P{k). The main interest in modeling ^(r"a.,r^) in the linear regime 
has been in the possibility of measuring /3 (e.g., Hamilton 1993). 

There has been considerable confusion about the compatibility of the two models for 



■^Equation 2 is strictly valid only if the galaxy pairs used to measure C{ra,rT^) subtend a small angle 
with respect to the observer (cf., Cole, Fisher, & Weinberg 1994; Zaroubi & Hofmann 1994); in this Paper 
we follow Kaiser (1987) and work in this "distant observer" limit. 
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i{fcT-,f-K) discussed above. There have been claims in the hterature that the streaming 
model given in Equation 1 fails to asymptote to the linear theory limit represented by 
Equation 2. Kaiser (1987) and McGill (1990) both pointed out that the streaming model 
fails to reproduce the terms of 0(/3^) in Equation 2. Fisher et al. (1994) emphasized that 
the usual formulation of the streaming model assumes a constant dispersion; they claimed 
that the missing terms could be recovered by generalizing the streaming model to include 
a scale-dependent dispersion which asymptotically approaches the shape expected in linear 
theory for large separations. 

The purpose of this Paper is to clarify the assumptions that enter the streaming model 
and show how the model can be made consistent with linear theory on large scales. The 
streaming model provides an accurate representation of the redshift distortions on small 
scales, while linear theory is appropriate on large scales. Understanding the relationship 
between the two approaches will be necessary to construct global models for ^(r"a.,r^) which 
are applicable on all scales; these models will be needed to if one wishes to take full 
advantage of the superior data that will be available in the next generation redshift surveys 
such as the Sloan Digital Sky Survey (Gunn & Knapp 1993; Gunn & Weinberg 1995) and 
the Anglo- Australian Telescope 2dF galaxy survey. 

The outline of this Paper is as follows. In § 2.1, we review the assumptions on which the 
streaming model is based. A basic inconsistency of the streaming model on large scales is 
its failure to properly model the density /velocity coupling inherent in linear theory; in § 2.2, 
we show how this can be rectified by using the the joint density and velocity probability 
distribution. In § 3, we compute the probability distribution in linear theory and, in § 4 
we show that it leads to a redshift-space correlation function which is consistent with the 
linear theory result given in Equation 2. The derivation reveals the origins of the linear 
theory distortions in Equation 2: the terms of 0(/3) are due to scale dependence of the 
mean streaming along the line of sight while the terms 0(/3^) are due to variations in the 
line of sight velocity dispersion. We conclude in § 5. 
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2 Probabilistic Interpretation of the Redshift Space 
Correlation Function 



2.1 Assumptions of the Streaming Model 

In order to see clearly the assumptions that enter the streaming model, it is helpful to recall 
the operational definition of the two-point correlation function in redshift-space, namely 
that ^(r"a.,r^) represents the excess probability (above Poisson) of finding a galaxy pair (at 
redshift positions, s and s') with separations and r^- parallel and perpendicular to the 
line of sight, 

dP = d^sd^s' [1 + C{r^,r^)] , (3) 

where n represents the mean density of galaxies. The streaming model. Equation 1, arises 
from the conjecture that this probability is equal to the following relationship between the 
real-space correlation function and velocity distribution function: 

dP = d^sd^s' [1 + i{r)] F,{V) S^'l^r^-y-^- v,,{r) - dV dy , (4) 

where = f'i ~\~ v'^ ■ The first term in Equation 4 is the Poisson contribution while ^(r) 
represents the excess probability of finding the galaxy pair in real-space due to clustering. 
The velocity distribution function and Dirac delta function convert the probability of finding 
separations in real-space to the the desired separations in redshift-space. By equating 
Equations 3 and 4 and integrating over the unobserved variables V and y one obtains the 
streaming model given in Equation 1. 

There are several points to be made regarding Equation 4: 

• The probability of finding the galaxy pair in real-space at separation r is assumed 
to be independent of the probability of having an associated relative velocity, V . 
This can be seen clearly in Equation 4 where there is an explicit factorization of the 
the probability of finding the galaxy pair, oc [1 + ^(r)], and the velocity distribution 
function, Fy{y). This factorization may be a reasonable approximation on highly 
non-linear scales where the streaming model is often applied, but as we will see in 
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§ 2.2, it is not valid in linear theory due to the coupling between the velocity and 
density fields. The streaming model partially compensates for this failure to model the 
density /velocity coupling by explicitly imposing a non-zero mean velocity streaming, 
'"i2{f), which would vanish in the complete absence of coupling between the density 
and velocity fields. This ad-hoc introduction of velocity/density coupling makes the 
streaming model suspect in the linear regime. 

• In the usual formulation of the streaming model, the second moment of F^[V), the 
velocity dispersion, is taken to be constant (or at most a slowly varying function of 
separation) and isotropic. However, linear theory predicts that velocities are spatially 
correlated with a scale-dependent anisotropic dispersion, again casting suspicion of 
the streaming model on large scales. 

• The form of F^[V) is arbitrary. However, in linear theory the distribution is uniquely 
specified once the statistical nature of the density field is known. In § 3, we will 
consider the case where the density field is a Gaussian random field. In this case, the 
joint distribution of densities and velocities is a multi-variate Gaussian. 

2.2 Accounting for Density /Velocity Coupling 

We now wish to modify Equation 4 to correctly account for any possible velocity/density 
coupling. This requires knowledge of the joint probability density of measuring both the 
number densities, 6[r) = n[r)/n — 1, and peculiar velocities along the line of sight at two 
real-space positions, x and x'. Let Fr^[rf) denote this joint distribution function with 



8{x') 
vz{x) 
\ v^ix') J 



(5) 



and f3 representing the component of the peculiar velocity along the observer's line of sight. 
In the distant observer approximation, the line of sight can be conveniently taken to lie 
along the Xs-axis; the redshift of a galaxy in this limit is just s{x) = x 3 + v^{x). 



5 



Since the probability of measuring a galaxy at a position r is proportional to the smooth 
underlying galaxy density, oc [1 + S[r)], the generalization of Equation 4 can be written 
down immediately ^ 

dP = n^d^sd^s' J d^rfdy {l + 6){l + 6') Frji-q) 6^ {r^-y -v' + v) , (6) 

where for convenience we henceforth adopt the abbreviated notation 6 = S[x), 6' = S[x'), 
V = v^{x), v' = v^{x'), V = v{x), and v' = v'[x'). The integral in Equation 6 averages 
the joint velocity and density distribution weighted by the probability of actually selecting 
a galaxy pair with a given redshift separation. Comparing Equations 4 and 6, we obtain 
the definition of the redshift-space correlation function which correctly accounts for any 
coupling between the density and velocity fields: 

1 + ({r.,r.) = J d'rfdy (1 + ^) (1 + 6') {r. - y - v' + v) . (7) 

Note that in the special case where the probabilities for the densities and velocities 
are truly independent, we can write Frj{rf) = Ps{8,8')Pv{v,v') where P^ and P„ are the 
separate joint distributions for the densities and velocities. In this case, the integral over 
the densities in Equation 7 just recovers the factor 1 + ^(r) which appears in Equation 4, 
i.e., 

j d8d8'{l + 8){l + 8')Ps{8,8') = l+i{r) . (8) 

3 Joint Density /Velocity Distribution in Linear The- 
ory 

We now proceed to calculate the joint distribution for the special case where the density 

field is Gaussian random field and the density and velocity fields are related by linear 

theory. In this case the joint distribution is a multi-variate Gaussian, 

careful reader will note that in addition to the delta function which imposes the correct relative sep- 
aration, s' — s, in redshift-space there should also be a delta function imposing the correct "center of mass," 
s' + s. Translational invariance, however, insures that the probability distribution, Frf{rf), depends only 
the redshift separation; the delta function for the center of mass constraint can therefore be immediately 
integrated out, giving Equation 6. 
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where C is the covariance matrix, C = [rfrj^). The task of computing F^{rf) therefore 
boils down to calculating C and its inverse; this involves computing the expected coupling 
between the density and velocity fields. 

In linear theory, it is straightforward to show that the density /velocity coupling is given 

by 

{Sv') = -r^ JdkkP{k)jr{kr) , (10) 

where f = x' — x. Symmetry demands that {6 v') be directed along the line of separation, 
f. Note also that Gaussian fields have the property that the density and velocity at a -point 
are uncorrelated, i.e., v) = 0. The bias factor appears in the formula (via j3 = 0,°'^ /b ) 
because 6 and P{k) refer to the fluctuations and power spectrum of the galaxy distribution. 

It is important to be careful about the meaning of the ensemble expectations values, 
(). The Gaussian random flelds we are considering are implicitly assumed to be ergodic, 
and hence ensemble expectation values are equivalent to spatial averages. We now wish to 



compute the average of the galaxy streaming, fi2(r") = v' — v. We must be careful, here, 
since the average in the case is not a spatial average, but a number- weighted average carried 
at over galaxy positions; since galaxies are located preferentially in high density regions [with 
an associated probability, oc (1 + ^)], these two averages need not, and generally are not, 
the same (cf., Bertschinger 1992). We can recast the number-weighted averages into the 
familiar spatial averages by explicitly including the probability of flnding a galaxy at given 
position. Thus, we write the expression for the mean streaming as 

= (v'S) — {v6') + higher order terms 

= — 2f ^— ^ J dk k P{k) ji{kr) (in linear theory) 

= ■Ui2(r)f , (11) 

where the factor (1 + ^)(1 + ^') represents the effect of number weighting. In Equation 11, 
the term {v' — v) vanishes by symmetry and higher order terms, such as {v88'), can be 



dropped in linear theory. 

In linear theory there is also a velocity /velocity coupling given by 



(12) 



where the velocity correlation functions parallel and perpendicular to the line of separation 
are (Gorski 1988) 
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kr 



kr 



(13) 



Equations 10, 12 along with the definition ^(r) = {6 6') completely specify the elements 
of the covariance matrix C. At this point we could write down C and proceed to compute 
its inverse. Computationally, it is much easier to transform to a new set of variables, rf' , in 
which the covariance matrix is nearly diagonal. Note that the actual probability remains 
invariant i.e., Fr^d'^rf = Fr^i d^rf' . A convenient set of variables are the eigenvectors of the 
individual density and velocity covariance matrices: 



V 



V v_ J 



' r+(r) ^ 

V (r) 

V -\-v 
o-|(t-) 

V -V 



(14) 



where 



2[^(0)±^(r) 



■ V- 



<±(-r*iiW±(- 

r r 



*±(r) 



(15) 



In the above equation, = + r\ with y and To- being the true separations parallel and 
perpendicular to the observer's line of sight. o"„ is the one dimensional rras velocity, 
o 1 



J dkp{k) 



3 2.^ J--- ■ 

which by Equation 13 is also given by = ^||(0) = ^_l(0). The quantity, cr_(r) is merely 
the pair- wise velocity dispersion along the line of sight. 

The covariance matrix expressed in the primed variables is quite simple. 
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(I \ 
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1 

V 1 / 



(17) 



where = ^ fi2(r)/(r^.o"_) and /Cj = ^ i'i2('")/(r_o"^.). The advantage of our variable 
transformation is that it simplifies the resulting expression for the inverse of C, 



/ \ 



v 








_K2_ 



1-K2= 1-K2 = 
1 



1-K2= 1-K2 = 









(18) 



Finally, substitution of Equation 18 into Equation 9 gives us the desired joint velocity and 
density probability distribution in linear theory, 



2^2 



exp 



(19) 



We see that in the limit of weak velocity/density coupling [k^ and K2 <C 1), the joint 
probability distribution reduces to two independent Gaussian distributions for the density 
and velocity fields. 



4 Linear Theory Limit for ^(r^,rvr) 

We can now use the linear theory probability distribution given in Equation 19 to explicitly 
evaluate the integrals in Equation 7 and thereby obtain the expression for the redshift-space 
correlation function: 



1 + C{r^,r^) = 

J d'rj' dy Fr,,{rj') 



1 + T,6, + ^{t,'6,'-TJ6J 



1 {r-K - y] 

2 aj(r) 



5^ {f-K-y - 0-+ V_ 



(20) 



where once again = + and the kernel G[y) is defined by 
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a_[r] 
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(21) 



The expression in square brackets in the first hne of Equation 20 is just the (1 + ^) (1 + 8') 
term in Equation 7 expressed in the primed variables. The expression for G{y) is obtained 
by direct integration over the variables, drf' = dS^ dS- dV^ dV_. 

To lowest order the exponential and kernel are given by 
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a_[r] 



-exp 



1 {'T-K - y) 

2 ajir) 



i 1 
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2a2 



exp 



1 {'T-K - y) 



f22l 



r a' 



where = 2a^. The redshift-space correlation function to corresponding order is 



r) + v^Jr) h I 1 
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(23) 



A similar result for the ^(r"a.,r^) in the linear regime has been derived by Regos & Szalay 
(1995). 

To evaluate Equation 23, we note that at separations large compared to the dispersion 
(r^ ^ a), the exponential will become sharply peaked at y = r^. This suggests that we 
expand ^(y) in a Taylor series about y = r^. 



(ir.,r^) = f: -^E(-\r^) /^(y-r.)"exp 



dy 



1 (^^TT - y] 

2 a2 



2 4 



S(r^) + — S(2)(r^) + — S(^)(r^) + terms of 0(^2) and higher 
Z o 



(24) 
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where S^"'^(r^) denotes the v}^ derivative of ^(y) evaluated at y = r^. The last line follows 
from the explicit form of ^(y) given in Equation 23 when terms quadratic in the power 
spectrum, i.e., cr , v'^^^r) , etc., are neglected. The last line of Equation 24 shows that the 
redshift-space correlation function in linear theory differs from the real space correlation 
function because of variations in both the mean streaming and the pair-wise dispersion 
along the line of sight, but not by higher order moments of the peculiar velocity field. 

Equation 24 is the correct redshift-space correlation function in linear theory. We can 
cast Equation 24 in the more familiar form of Equation 2 by using the definitions of the 
streaming and dispersion in terms of the power spectrum (Equations 11 and 15) and then 
directly computing the derivatives along the line of sight. The calculation is fairly laborious, 
but the end result is remarkably simple; one finds two equations relating the derivatives of 
the streaming and dispersion to moments of the power spectrum. 



5^ (-^(')) 



/3 



(25) 



Substitution of the above result into 24 indeed shows that the expressions for ^(r"a., r^) given 
in Equations 24 and 2 are equivalent. Equations 2 and 25 are the principal results of this 
Paper. 



5 Discussion 



In this Paper, we have presented an alternate derivation of the redshift-space correlation 
function in the linear regime. In contrast with previous derivations based on linear the- 
ory, we treat the density and velocity fields in a statistical manner, casting the redshift 
distortions as appropriate convolutions over the joint density and velocity probability func- 
tion. The advantage of the probabilistic derivation is that it parallels the formulation of 
the streaming model which has been frequently employed to model ^(r"a.,r^) on non-linear 
scales. The derivation shows that the failure of the usual streaming model to reproduce the 
terms of 0(/3^) in linear theory is due to its omission of a scale-dependent dispersion. In 
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linear theory, where we can neglect terms of 0[(^), we can rewrite the correct expression 
for the redshift-space correlation function (Equation 23) as 

J yZ'Ka_[r) 

Comparing this with Equation 4, we see that this is -precisely the streaming model with a 
Gaussian velocity distribution and a scale-dependent velocity dispersion. This verifies the 
claim made by Fisher et al. (1994) that the streaming model correctly reproduces linear 
theory when it generalized to include a scale-dependent velocity dispersion. 

The analysis presented suggests a way to modify the streaming model to make it ap- 
plicable in both the highly non-linear and linear regimes. On highly non-linear scales, one 
expects the velocity/density coupling will be reduced from its linear theory value, and that 
the velocity dispersion (e.g., from the virial theorem) will only be a weak function of scale 
(cf., Peebles 1980, § 76). In such a regime, the standard streaming model with an expo- 
nential velocity distribution function provides an excellent fit to the observations (Fisher et 
al. 1994). However, to preserve the correct linear theory behavior at large separations, the 
distribution function needs to tend towards a Gaussian with mean and dispersion given by 
the correct linear theory functional forms. One possibility would be to introduce a gener- 
alized distribution function with moments that are exponential with isotropic moments on 
small scales (^(r) ^1) but which tends towards a Gaussian with an anisotropic dispersion 
in the linear regime (^(r) <C 1). It is doubtful if the sampling and depth of current redshift 
surveys warrant such detailed modeling of the correlation function. With the appearance 
of much deeper surveys such as the Sloan Digital Sky survey, it will become increasingly 
important to have models for ^(r"a.,r^) which model both the non-linear and linear regimes. 
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